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Introduction 

It would be helpful, in situations in which hu- 
mans and robots work in cooperation with one 
another, for the robots to be able to understand 
the humans' motions and help appropriately. To 
make this possible, it is important to perceive hu- 
man motion in video images as sequences of indi- 
vidual movements. A number of techniques have 
been proposed for the recognition of human move- 
ments [1][2], and we ourselves have presented a 
gesture-recognition method focusing on rezd-time 
applicability [3]. Features of our model are: 

• Use temporal edges as a feature 

• To use Continuous Dynamic Programming 
(CDF) for matching against the models [4] 

The temporal edges are obtained by passing the 
input from a CCD cemaera through temporal-differencin] 
spatial- reduction, temporal-averaging, and loga- 
rithmic processing, producing what we call a "feature- 
image". All of these processes are comparatively 
simple, and are thus well suited to real-time recog- 
nition systems. CDP matching is one a group of 
methods known as "spotting" techniques, and is 
capable of producing a result for each frame as it 
is processed. Also, because the amount of match- 
ing time it is requires is only proportional to the 
total number of frames in a model, it is commonly 
used in the field of voice recognition. For image 
recognition, however, for every unit of time, the 
number of dimensional distances that have to be 
calculated for feature-images is equal to the to- 
tal number of frames — and this poses a problem 
when constructing systems capable of recognizing 
large-scale models. 

• clustering using the k-means method, and 

• reducing computation-time by parallel pro- 
cessing. 

Clustering 

The diverse gestures made by human beings can 
be thought up of as being constructed from a 
set of simple elemental movements. It can be ef- 
fective, particularly for systems using large-scale 
models, to classify these movements into cate- 
gories, and then to express gestures using cate- 
gory numbers. Also, if the number of categories is 



Table 1: Recognition-ratios Using Clustering (%) 



Feature-image 


98 


32Clusters 


83 


64Clusters 


95 



smaller than the total number of models, this can 
be a useful technique for improving speed, since 
the number of times distance between feature- 
images into categories, and then conducted recog- 
nition tests. 

Recognition Testing 

The gestures that served as the objects of recogni- 
tion consisted of four types of motions: (l)putting 
^own a book, (2)opening the book, (3)turning to 
'the next page, (4) returning to the previous page, 
(5)closing the book, and (6)holding the book. These 
were filmed from seven different directions fifteen 
degrees apart, centered on the position directly 
in front of the experimental subject, for a total 
of 42 models. We then checked the recognition- 
ratio using the same human subject, wearing the 
same clothes, against the same background, for 
all 6 motions x 7directions (Table 1). Our input- 
image size was 256x256, the feature-image size 
was 16x16, and the temporal- averaging constant 
was three frames. This test showed that, assum- 
ing that one selects an appropriate number of 
clusters, there is hardly any difference from the 
case in which the distance calculations are made 
using the feature-image directly. 

Calculating Time 

Next, we conducted comparisons of calculating 
time between the case in which the feature-images 
were used and that in which 64 clusters were 
used(Table2). The input video was 495 frames 
long, and the total number of frames in the standard- 
patterns was 967. 

We found that the time required for distance cal- 
culations was thirteen times faster in the case us- 
ing k-means. 

Parallel Processing 

For CDP matching, we calculate the local dis- 



Xplore Search Results 

IEEE HOME I SEARCH IEEE I SHOP I WEB ACCOUNT I CONTACT ,\BEB 



\tvysiwyg://8/http://searchjeeexpl^^ 



Membership Publications/Services Standards Conferences Carefers/Jobs 

IEEE XnlOrP® I welcome 

■ *■ ^ ^ ^HJyr')- United states Patent and Trademark Office 


Help FAO Terms IEEE Quick Links p] V » Search Results 
Peer Review 




Your search matched 8 of 922177 documents. 

A maximum of 8 results are displayed, 25 to a page, sorted by Relevance in descending 
order. 

You may refine your search by editing the current search expression or entering a new one 
the text box. 

Then click Searcti Again. 


O" Horns 

O What Can 
1 Access? 
O" Log-out 


Tables of Contents 


O" Journals 
& r4agazines 

O" Conference 
Proceedings 

O" Standards 


(parallel <near> clustering) and simultaneous 


Search Again 


Results: 

Journal or Magazine = 3NL Conference = CNF Standard = STD 


Search 


By Author 
O" Basic 
O" Advanced 


1 An Optical Bus Computer Cluster with a deferred cache 
coherence protocol 

Sh /bay a ma, S.; Hamaguchi, K,; Fukui, T.; Sudo, K; Shimoyama, 7.; 
Nakamura, S.; 

Parallel and Distributed Systems, 1996. Proceedings., 1996 
International Conference on , 3-6 Jun 1996 
Page(s): 175 -182 


Member Services 


O-JoinlEHE 
O Establish IEEE 
Web Accotmt 

O" Access the 
IEEE Member 
Digital Library 



Print Format 



FAbstractl FPDF Full-Text (784 KB^1 IEEE CNF 



2 SWEB: towards a scalable World Wide Web server on 
multicomputers 

Andresen, D,; Tao Yang; Holmedahl, V,; Ibarra, 0,H.; 

Parallel Processing Symposiunn, 1996., Proceedings of IPPS '96, The 

10th International , 15-19 Apr 1996 

Page(s): 850 -856 



FAbstractl rPDF Full-Text (716 KB^I IEEE CNF 



3 A clustered approach to multithreaded processors 

Krishnan, V,; Torrellas, 7.; 

Parallel Processing Symposium, 1998. 1998 IPPS/SPDP. Proceedings 
of the First Merged International. ..and Symposium on Parallel and 
Distributed Processing 1998 , 30 Mar-3 Apr 1998 
Page(s): 627 -634 



FAbstractl FPDF Full-Text (796 KB)1 IEEE CNF 



Table 2: Calculation Time (s) 



CCD Camera 





Feature 


64Clusters 






Local-distance Calculation 


56.2 


0.2 




r 


Categorization 


0.0 


4.2 






Total 


56.2 


4.4 


III 150/40 









Temporal Differencing 
Spatial Reduction 



tance d(t^ r) between input frame t and standard- 
pattern frame r, for all of the standard- patterns. 
Then, using these results, we progressively calcu- 
late the cumulative distance t) for each pat- 
tern, as follows: 

S(-l,r) = 5(0,r) =oo (1 < r < T) (1) 



5(t,l) = 3.d(t,l) 



(2) 



5(t,2) = 

{S{t - 2, 1) -f 2 . <i(t - 1, 2) -f- 2) 
S{t - 1, 1) -f 3 . 2) 
S{t, l) + 3.d(t,2) 



(3) 



S(t,T) = 



Sun SS10 
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Ethernet 



Cray CS6400 



- 2,r - 1) -I- 2 . d{t - l,r) -I- d(t,r) 
min^ 5(i - l,r 3 r) 

[ S(t-l,T~2)-\-Z'd{t,T-l) + S-d{t,r) 
(3 < r < T) 
(4) 

The cumulative distance S can be calculated in 
parallel for each frame of each standard- pattern. 
Accordingly, we converted the algorithms for par- 
allel processing and ran calculating-time tests, us- 
ing a CRAY CS6400 as our computer. At present, 
the CS6400 at RWCP has 12 CPU's. Since one 
of these, however, is used ad the system con- 
troller, we ran our comparison on the remaining 
11 CPUs. The result showed a calculation speed 
4.2 times faster than on a 1-CPU system. 

The Real-time Gesture-spotting 
System 

Based on the results of the experiments we have 
conducted up to now, we have created a system 
that recognizes gestures in real-time (30 times per 
second). The series of images coming from the 
CCD camera is passed as input into an off-the- 
shelf general- purpose image-processing board (IMAGING 
TECHNOLOGY Series 150/40). This board runs 
edge-extractions, spatial reduction, and tempo- 
ral averaging on the images. All of the feature- 
extraction parameters are the same as the val- 
ues we used in the test system. The memory on 
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Figure 1: System Configuration 



the image-processing board can be accessed from 
the host workstation, a Sun SSIO. The feature- 
images produced in that on-board memory are 
than transferred via socket- communication to the 
CS6400, which conducts the CDP and the loga- 
rithmic processing (Fig.l). We have verified that 
our system can work in real time with 42 standard- 
patterns(containing 967 frames). 

Conclusion 

We have constructed a system with which it is 
possible to perform matching with a large num- 
ber of models in real time. In the future, we plan 
to develop more efficient ways of representing ges- 
tures, to construct a noise-robust system, and to 
raise the efficiency way of our paredlel processing. 
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